SnIPRE: Selection Inference Using a Poisson Random Effects Model

نویسندگان

  • Kirsten E. Eilertson
  • James G. Booth
  • Carlos D. Bustamante
چکیده

We present an approach for identifying genes under natural selection using polymorphism and divergence data from synonymous and non-synonymous sites within genes. A generalized linear mixed model is used to model the genome-wide variability among categories of mutations and estimate its functional consequence. We demonstrate how the model's estimated fixed and random effects can be used to identify genes under selection. The parameter estimates from our generalized linear model can be transformed to yield population genetic parameter estimates for quantities including the average selection coefficient for new mutations at a locus, the synonymous and non-synynomous mutation rates, and species divergence times. Furthermore, our approach incorporates stochastic variation due to the evolutionary process and can be fit using standard statistical software. The model is fit in both the empirical Bayes and Bayesian settings using the lme4 package in R, and Markov chain Monte Carlo methods in WinBUGS. Using simulated data we compare our method to existing approaches for detecting genes under selection: the McDonald-Kreitman test, and two versions of the Poisson random field based method MKprf. Overall, we find our method universally outperforms existing methods for detecting genes subject to selection using polymorphism and divergence data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A note on conditional Akaike information for Poisson regression with random effects

A popular model selection approach for generalized linear mixed-effects models is the Akaike information criterion, or AIC. Among others, [5] pointed out the distinction between the marginal and conditional inference depending on the focus of research. The conditional AIC was derived for the linear mixed-effects model which was later generalized by [4]. We show that the similar strategy extends...

متن کامل

Time-Dependent Random Effect Poisson Random Field Model for Polymorphism within and Between Two Related Species

TIME-DEPENDENT RANDOM EFFECT POISSON RANDOM FIELD MODEL FOR POLYMORPHISM WITHIN AND BETWEEN TWO RELATED SPECIES by Shilei Zhou Dr. Amei Amei, Examination Committee Chair Associate Professor of Mathematical Sciences University of Nevada Las Vegas Molecular evolution is partially driven by mutation, selection, random genetic drift, or combination of the three factors. To quantify the magnitude of...

متن کامل

Statistical Inference of Selection and Divergence from a Time-Dependent Poisson Random Field Model

We apply a recently developed time-dependent Poisson random field model to aligned DNA sequences from two related biological species to estimate selection coefficients and divergence time. We use Markov chain Monte Carlo methods to estimate species divergence time and selection coefficients for each locus. The model assumes that the selective effects of non-synonymous mutations are normally dis...

متن کامل

Bayesian paradigm for analysing count data in longitudina studies using Poisson-generalized log-gamma model

In analyzing longitudinal data with counted responses, normal distribution is usually used for distribution of the random efffects. However, in some applications random effects may not be normally distributed. Misspecification of this distribution may cause reduction of efficiency of estimators. In this paper, a generalized log-gamma distribution is used for the random effects which includes th...

متن کامل

Exact Statistical Inference for Some Parametric Nonhomogeneous Poisson Processes

Nonhomogeneous Poisson processes (NHPPs) are often used to model recurrent events, and there is thus a need to check model fit for such models. We study the problem of obtaining exact goodness-of-fit tests for certain parametric NHPPs, using a method based on Monte Carlo simulation conditional on sufficient statistics. A closely related way of obtaining exact confidence intervals in parametri...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 8  شماره 

صفحات  -

تاریخ انتشار 2012